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Abstract. The response to a knockout of a node is a characteristic 
feature of a networked dynamical system. Knockout resilience in the dy- 
namics of the remaining nodes is a sign of robustness. Here we study the 
effect of knockouts for binary state sequences and their implementations 
in terms of Boolean threshold networks. Beside random sequences with 
biologically plausible constraints, we analyze the cell cycle sequence of 
the species Saccharomyces cerevisiae and the Boolean networks imple- 
menting it. Comparing with an appropriate null model we do not find 
evidence that the yeast wildtype network is optimized for high knockout 
resilience. Our notion of knockout resilience weakly correlates with the 
size of the basin of attraction, which has also been considered a measure 
of robustness. 
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1 Introduction 

Living systems show an ubiquitous robustness against mutations, environmental 
changes and intrinsic non- determinism [1-3]. In particular, each single cell must 
control its growth and eventual division by regulating concentrations of proteins 
in a precise temporal pattern. Using a Boolean state dynamics [4], this cell 
cycle network has been argued to be robustly designed for budding yeast as 
a model organism [5]. The robustness has been pinpointed as reproducibility of 
the dynamics in the presence of stochastic perturbations [5-7] . Resilience against 
mutations, i.e. changes of the interactions among the proteins, has been studied 
as well [5,8,9]. 

A particular type of mutation, either intrinsic or by intervention, is a com- 
plete knockout of a single protein. Knockouts are a type of component failure 
often used in experiments to probe cellular functions. A gene is made dysfunc- 
tional such that it is no longer transcribed and its product is effectively removed 
from the cell. Some knockouts can be tolerated or compensated by the cell, 
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whereas others are lethal. Knockouts are often used to infer the function of spe- 
cific proteins. This is only appropriate, if the knockout is neither lethal nor fully 
compensated, but disables a specific function of the cell. Importantly in real ex- 
periments, knockout resilient systems are more difficult to analyze and identify 
because knockout mutants do not exhibit measurable difference. 

In this contribution we study the resilience against knockouts in two sce- 
narios. First, the system under consideration is defined only by a sequence of 
activation patterns regardless of the specific mechanism producing them ( "black 
box"). Here, resilience with respect to knockout of a node means that the infor- 
mation contained in the activation patterns of all other nodes is still sufficient 
to unambiguously produce the original sequence. In the second scenario, we con- 
sider the sequence together with a given implementation by a Boolean threshold 
network. Knocking out node j means that we remove its interactions with the 
other nodes. The network is resilient against this knockout if all other nodes still 
perform the original sequence of activation patterns. 

After defining these notions of knockouts and resilience we apply them to 
the yeast cell cycle sequence and its network implementations. Significance of 
the results is assessed by comparison of random sequences as null models with 
various constraints. 

2 Defining knockouts and robustness 

Molecular processes within cells are frequently modeled by Boolean networks [4, 
10, 11]. The nodes of the network correspond to different molecules which can be 
present in either high or low concentrations. Interactions between the molecules 
change these concentrations which leads to dynamics in discrete time steps of 
switching events. The dynamics is thus described as a temporal sequence of 
activation patterns x(0), x(l), . . . , where each activation pattern x has n binary 
components x^, i = 1, . . . ,n. Low and high concentration of molecule i at time 
t are denoted by Xi{t) = and Xi{t) = 1, respectively. Each node i computes a 
Boolean function fi mapping the present concentration pattern to its activation 
at the next time step, i.e. Xi(t + 1) = . . . ,Xn{t)). 

2.1 Robustness of function 

Let us first define what we mean by robustness of a single node i against knockout 
of another node j. We assume that the input nodes follow a certain dynamics 
. . . ,Xn{t), where t > 0. The set of possible activation patterns which the 
input takes is called the input support It can be represented as a set of binary 
strings of length n. 

Node i is robust against knockout of node j if i follows the same sequence 
even without the presence of node j. This means that the state information of 
node j is either irrelevant for i or already contained in the other available inputs. 

Definition: A node i with mapping fi : {0, 1}^ {0? 1} is robust against 
knockout of node j if Xi is independent of xj given xi, . . . , Xj+i, . . . x^. 
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With this we mean that 

whenever both (xi, . . . , 0, Xj+i, . . . x^) and (xi, . . . , 1, . . . Xn) lie 
in the input support. Robustness against simultaneous knockout of multiple 
nodes is defined in a similar fashion. 

It is important to note that this definition depends on the input support. If 
the input xi, . . . takes each possible value, then robustness against knock- 
out of node j implies that fi does not depend on Xj. However, if the inputs 
are highly correlated, then nontrivial robustness may appear: For example, it 
may be possible that Xi can compensate the knockout of a single input by re- 
constructing the missing information from another input with similar dynamics, 
but the simultaneous knockout of both inputs leads to a failure. 

Our notion of knockout robustness is based on ideas of Ay and Krakauer [12]. 
Within the framework defined there [12], resilience means that the exclusion 
dependence of the system with respect to certain knockouts vanishes. Combi- 
natorial conditions characterizing this situation have been found by Herzog et 
al. [13]. The theory becomes much simpler in the present setting restricted to 
deterministic dynamics. 

Our definition of robustness can be applied to each node of the cell cycle 
network shown in Figure 1 . In our studies of the yeast cell cycle the input support 
will be the set of activation patterns which appear in this cycle. We then study 
each node mapping and ask, which inputs are "essential" for the functioning of 
this node and which inputs can be compensated. 

In order to study robustness of the system as a whole there are multiple 
possibilities: As a measure of system robustness we count the number of nodes 
which are robust with respect to all single node knockouts, i.e. how many nodes 
would remain functional if any one input would be knocked out. Another pos- 
sibility is to find the knockouts under which the behavior of the whole system 
is robust. In this case we find the single node knockouts which can be compen- 
sated by all (other) nodes of the network. The set of these knockouts is called 
the resilience combination. The cardinality of the resilience combination is then 
used as a measure of knockout resilience. 

2.2 Linear threshold networks and robustness of implementation 

The analysis presented up to now just uses properties of the sequence of node 
activations and thus cannot distinguish between different protein interaction 
networks implementing this sequence. In order to apply our notion of robustness 
to example systems and investigate the effects of specific network structures we 
focus on linear threshold networks. Such a network is given as a directed graph 
among n nodes with weighted edges. We allow loops, i.e. edges starting and 
ending at the same node. For simplicity we only allow weights Wij = 1 and 
Wij = —1 for each directed edge i ^ j. If there is no edge from node j to node 
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i we write Wij = 0. To each node i we associate a time-dependent variable Xi{t) 
with dynamics given by 

if Y.jWijXj{t) > 0, 
Xi{t + 1) = < if J2j < 0, (2) 

^Xi{t) else. 

It is possible to refine this model by allowing more values for the edge weights. 
Furthermore it is customary to introduce threshold parameters for every node. 
Then the dynamics is determined by comparing WijXj{t) to this threshold. 
In this work we do not make use of these possibilities and restrict ourselves to 
the simple model. 

One important feature of the definition of robustness on page 2 is that it 
only depends on the values of the functions fi on the support. If we apply this 
definition to a Boolean network then, in a sense, the mechanisms at work are 
ignored, and only the sequence of activation patterns given by the dynamics 
plays a role. Therefore we will call this robustness of function. 

In the context of linear threshold models we can define a related notion of 
robustness, which we will call robustness of implementation: A knockout of a 
node j is modeled by removing a node from the network, together with all of 
the edges involving this node. We can then analyze the dynamics of the changed 
network and compare it to the dynamics of the unperturbed network. If the 
dynamics of the remaining n — 1 nodes is not changed, then we say that the 
implementation can compensate the knockout of node j. This notion allows to 
compare different linear threshold networks which implement the same activation 
sequence with respect to their knockout resilience. 



3 Analysis of the yeast cell cycle 

For the analysis the model describing the cell cycle of the yeast species Saccha- 
romyces cerevisiae, introduced by Li et al. [5], is used. It consists of 11 nodes 
(see Figure 1) which represent four different classes of molecules (cyclins tran- 
scription factors and the inhibitors, degraders, and competitors of the cyclin 
complexes). Those molecules are involved in the control of the cell cycle process 
and act as key regulators. 



3.1 Robustness of function 

We now apply the ideas of section 2.1 to the yeast cell cycle. Starting with the 
system robustness we find that only the four nodes CLN3, CLN1,2, CLB5,6 and 
CDC20 are robust against all single node knockouts^. Regarding the knockout 
resilience the cell cycle still functions correctly if any one of the five nodes MBF, 
SBF, MCMl/SFF, SWI5 or CDHl is knocked out. 



^ For CLN3 this is trivial since it corresponds to the constant map y = 0. 
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Fig. 1. The wildtype network of the cell cycle of the yeast species Saccharomyces 
cerevisiae [5]. The edges of the network are directed and can be activating (green 
arrow) or inhibiting (red arrow). All self-couplings (yellow) are inhibiting. The network 
comprises 34 different interactions, 15 of which are activating and 19 are inhibiting. 
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Table 1. The cell cycle sequence of the yeast species Saccharomyces cerevisiae [5]. 
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Fig. 2. {a — b) The number of inputs which can be compensated by all other node 
are shown for the first null model (a) and the more realistic null model (b). Blue bars 
represent networks with random cell cycles and the green bar represents the knockout 
resilience of the wildtype network, (c — d) The number of nodes which are robust with 
respect to all single node knockouts for the first null model (c) and the more realistic null 
model (d) are shown. The blue bars represent networks with a random cycle sequence 
while the green bar shows the system robustness for the wildtype network itself. Note 
the logarithmic scale of the four diagrams. 
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We can now ask whether these findings are a special property of the cell 
cycle or rather expected for a support containing 13 out of 2^^ = 2048 possible 
sequences. In order to answer this question we need to define a null model of 
sequences which corresponds to a cell cycle. A distinguished feature of the cell 
cycle is that it can be triggered by a single signal protein, i.e. the first and 
last activation pattern only differs by the activity of a single node. As a first 
test we generated random activation sequences of 11 nodes where each node is 
active with probability |. The initial activations were taken as the last activation 
pattern with a change in a single node activation. Cycles containing repeated 
node activation patterns were discarded and redrawn. 

Figure 2 (a) shows the fraction of random cycles with a certain knockout 
resilience. The five nodes of the yeast cell cycle are untypical compared to this 
null model, i.e. the yeast can compensate less knockouts than expected. The 
system robustness of the random cycles is shown in Figure 2 (c). With respect 
to this measure the four robust nodes of the yeast cell cycle are rather typical. 

Does this mean that the yeast cell cycle is not particularly robust against 
knockouts? A closer look at the cell cycle reveals that its activation pattern is 
far from random. Another null model which more closely resembles the observed 
activation pattern is based on the following properties: 

— CLN3 acts as an input node. Its activation starts the cycle and it remains 
silent throughout the sequence. 

— The other nodes do not switch their activations randomly during the cycle, 
but are activated and inactivated exactly once during the cycle. Their ac- 
tivity (or inactivity) is therefore constrained to a block of successive time 
steps. 

Therefore we consider a second null model of random cycles obeying these two 
properties. Each node, except node CLN3 with a fixed sequence, switches its 
activation at two randomly drawn time steps within the cycle. As before, cycles 
with repeated activation patterns were discarded. 

Figure 2 {b) and {d) show the robustness and knockout resilience for random 
cycles from this null model. Now the yeast cell cycle shows typical values for both 
considered statistics. This suggests that the yeast cell might not be optimized 
for high robustness and knockout resilience, but just happens to exhibit these 
properties due to purely statistical reasons. 

3.2 Robustness of implementation 

To further understand to what extent knockout resilience played a role for the 
evolutionary design of the cell cycle, we investigated its implementation as linear 
threshold networks. For the knockout experiment we looked at all networks which 
are capable of reproducing the cell cycle sequence (see Table 1). The set of all 
such networks can be considered as the set of vertices of the neutral graph. Two 
networks are then connected by an undirected edge if they differ by a single 
mutation. In our case a mutation corresponds to a addition or deletion of an 
edge or the change of a weight in the interaction network [9, 14]. 
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Table 2. Resilience combinations and their abundance among networks implementing 
the cell cycle sequence. All components refers to unconstrained implementations while 
those in the wildtype component are reachable from the wildtype by one or several 
mutations. 
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Table 3. The average basin sizes for the different knockout resiliences in the wildtype 
component and all components are shown. 
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Fig. 3. The three diagrams show different aspects of the knockout experiment. One 
column represents one specific resilience combination shown on the bottom, (a) The 
distribution of the specific knockout schemes is shown. The green bars represent the 
wildtype component. The blue bars represent all components and the single red bar 
represents the resilience combination of the wildtype itself. Altogether 24 different 
resilience combinations were found, 8 of which are only present in the wildtype compo- 
nent, (b) The distribution of positive, negative and all edges for each of the correspond- 
ing resilience combinations are shown. For each resilience combination 10^ network were 
sampled. The darker lines with the squares represent networks from all components, 
while the brighter lines with diamonds represent networks only reachable by mutations 
from the wildtype. (c) For each knockout scheme 10^ different networks were sampled. 
The averages of the basin sizes corresponding to the specific resilience combination are 
shown. The blue line with squares represents the networks from all components, while 
the green line with a diamond represents networks only from the wildtype component. 
Additionally, the area between the 10% and 90% quantiles is inked. The resilience 
combinations are ordered such that the average basin size increases. 
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As shown in [9] the neutral graph decomposes into many different connected 
components. This is why we compare the wildtype network with the wildtype 
component as well as the whole neutral graph. For each network we checked 
if each possible single node knockout is changing the cell cycle sequence (see 
Table 1) or if it has no influence on it. We found 24 different occurring resilience 
combinations. Out of these only eight were found in the component where the 
wildtype is situated (see Table 2). 

The majority of networks, i.e. ^ 99% of all the functional networks of the 
neutral graph and ~ 90% of the networks in the wildtype component, cannot 
cope with any single node knockout. However, a maximum of four independent 
single node knockouts were found. The wildtype itself is found to be only capable 
of coping with the single node knockout of the node CDHl^ see Figure 3 (a) 
and networks reachable by a mutational path from the wildtype can manage at 
maximum three independent knockouts. 

One might speculate that high knockout resilience requires redundant wiring 
of the network which would be observable as an increased edge density. In Figure 
3 {h) we look at the distribution of the number of positive, negative and all edges 
for the different resilience combinations. There is no clear correlation between 
the average number of edges and the knockout resilience. 

As suggested by Li et al. [5], the basin of attraction of the Gl fixed point 
(stationary state) is a measure of robustness. The basin consists of all activation 
patterns from which the Gl state is eventually reached by following the dynamics 
(2). The average basin size of networks with a given resilience combination is 
shown in Figure 3 (c). In Table 3 the average basin size for the different knockout 
resiliences is shown. With an increasing capability to cope with more single node 
knockouts the average basin size increases. Additionally, average basins sizes in 
the wildtype component are larger then their corresponding basin sizes in all 
components. However, all network implementations in the wildtype component 
have a network resilience of at most three. 



4 Discussion 

We have studied dynamics with Boolean state vectors (activation patterns) un- 
der knockout of single components, being the nodes of an underlying network. 
The yeast wildtype network is not optimized for knockout resilience, given the 
sequence of activation patterns. There are networks with significantly larger 
knockout resilience implementing the same sequence. 

Finally, we stress that our definition of resilience checks whether a node is 
dispensable for integrating and transmitting information only in the context of 
the regulatory network we consider. This regulatory network is far from being a 
complete description of cell function. In reality, therefore, the considered node 
may be involved in other functions and indispensable for survival thereby further 
reducing the knockout resilience. 
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